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Abstract. A random balanced sample (RBS) is a multivariate distribution 
with n components Xi, . . . , X n , each uniformly distributed on [—1,1], such 
that the sum of these components is precisely 0. The corresponding vectors 
X lie in an (n — 1)— dimensional polytope M(n). We present new methods 
for the construction of such RBS via densities over M (n) and these apply 
for arbitrary n. While simple densities had been known previously for small 
values of n (namely 2, 3 and 4), for larger n the known distributions with large 
support were fractal distributions (with fractal dimension asymptotic to n as 
n — » oo). Applications of RBS distributions include sampling with antithetic 
coupling to reduce variance, and the isolation of nonlincaritics. We also show 
that the previously known densities (for n < 4) are in fact the only solutions in 
a natural and very large class of potential RBS densities. This finding clarifies 
the need for new methods, such as those presented here. 



1. Introduction 

The aim of this article is to improve our understanding of "random balanced 
samples" (RBS), multivariate distributions having specified marginals as well as 
a "balanced" property; the precise definition is given below. Random balanced 
samples have found several applications, described in the literature, but the con- 
struction of nontrivial RBS distributions has presented interesting mathematical 
challenges. We describe new methods that generate RBS densities with respect to 
the underlying Lebesgue measure. In contrast, previous work on this problem has 
relied on fractal geometry and iterated function systems. Ironically, the earliest 
examples of RBS distributions were given by simple explicit densities. In Sectional 
however, we show that those earlier techniques cannot be extended to higher di- 
mensions. 

Definition 1. A random balanced sample (RBS) of size n is a system of random 
variables 

X = (Xi,X2, ■ ■ ■ , X n ) 

such that X is balanced, ie 

n 

fc=l 
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and each X k is uniformly distributed over [— 1, 1] . Other marginal distributions 
might be considered, but we focus here on the uniform case. 

Our main concern here is to present new methods for constructing RBS densities 
(see section 2), and to point out the severe limitations of older methods (see section 
3). Nevertheless, to set the context we briefly review some situations where RBS 
distributions may be usefully applied. 

A classical technique for reducing Monte Carlo variance is the use of "antithetic 
variates". First introduced by Hammersley and Morton in 1956 pQ, it has been ex- 
tended to larger groupings of variables by Arvidsen and Johnsson [2] , and applied 
to the bootstrap by P. Hall For more recent applications, see the papers of 
Craiu and Meng ^ |SJ E| • In the following example we motivate the use of anti- 
thetic variates, and we show that in the exchangeable case, RBS distributions are 
extremely antithetic. That is, the variables are as negatively correlated as possible. 

Example 2. We may wish to estimate the mean 1 fix) dx of an unknown 
function f : [— 1, 1] — > R by means of 



fn 



1 



k=l 



f{x k ). 



The variance of f n will be reduced if cov(f(Xk), f(Xj)) < for k ^ j. This reduc- 
tion can be achieved for a variety of functions f by means of antithetic coupling of 
the sampling variables Xk, ie by insisting that cov(Xk, Xj) be negative for k j. 
Note that for X k uniformly distributed over [—1,1] we have 



E 



fe=i 



n 
3 



Y^E{X k X, 



so that if cov(Xfc, Xj) = a for all k ^ j (eg in the exchangeable case) then 



n{n — 1) 



E 



k=l 



Xi 




Thus the minimum value of a is — 3 („ 1 _ 1 ) , achieved precisely when we have a RBS. 

Example 3. In the same setting as Example the estimate f n via a RBS al- 
lows a cleaner distinction between the linear and nonlinear parts of f . Indeed, if 
f{x) — Cx + g(x) where g is nonlinear then a RBS entirely eliminates the linear 
component Cx (even though the constant C may not be known) from the estimate 
f n . Applications of this nature are discussed in Gerow et al. |7l 151 151 110|.' see 
also |11| . 

The structure of RBS distributions is by no means a simple matter; there is 
a bewildering variety of RBS distributions for sample size n > 3. Each may be 
regarded as a probability distribution on the regular polytope 



M(n) = I x € [-1, 1]™ : ^ x k = 



fc=i 



with the additional requirement of marginal uniformity. For small values of n, the 
polytopes M(n) are familiar geometric objects; for example, one easily sees that 
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Figure 1 . The Robson density graphed as a piecewise linear den- 
sity over M(3), represented as a regular hexagon. Gerow discov- 
ered that a similar construction (density proportional to max \ Xk\) 
works on M(4). 



M(3) is the regular hexagon with vertices (±1, =pl, 0), (±1, 0, =Fl), and (0, ±1, =Fl). 
It seems that the earliest solution to this puzzle (aside from the trivial case where 
n = 2) is due to D. Robson. He noted that a simple piecewise linear density on 
M(3) meets the requirements. See Figure ^ below. K. Gerow, in [23J, extended 
Robson's method to n = 4, and detected problems for this method in case n > 4. 
As it turned out, even a broad generalization of the Gerow-Robson method could 
not produce RBS densities for n > 5. This phenomenon is studied in detail in ^ 
below. 

As alternatives to the Robson density on M(3), many other RBS distributions 
were constructed; part of [5] is devoted to a survey of these, many of which have 
attractive geometric features. In particular, certain fractal measures define RBS 
distributions for arbitrary sample size n. See Figures |21 and |3] below. While such 
fractal measures cannot have all of M(n) as their support, the fractal dimension of 
their supports can be made large with respect to the topological dimension n — 1 
of M(n) (see |H| F° r some time, there appeared to be a dichotomy in the 

possible RBS constructions: for small n we had simple RBS densities, but for 
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Figure 2. A fractal "superstar" RBS distribution on the hexagon 
M(3). The histogram generated by this simulation is shown at the 
bottom, visually verifying the uniform distribution of Xj,- 

larger n only singular RBS distributions (the fractal constructions, which produce 
measures singular with respect to the natural (n — l)-volume on M(n)). 

More recently however, we have found constructions that yield RBS densities 
for any sample size n. These are described in |21below. They introduce a technique 
of redistribution that may have other applications as well. Like the earlier frac- 
tal constructions (which can be implemented via iterated function systems), these 
new methods are algorithmically effective. That is, workers in the field can easily 
generate sampling vectors X by following the procedures outlined in |J2| 



2. Optimal densities 
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Figure 3. As in Figured a fractal RBS may be generated in the 
octahedron M (4) by means of an IFS (iterated function system) 
mapping all of M(n) closer to its various vertices (6 in this case). 



2.1. Goals. Given a RBS, it may be optimized in several directions, and these 
goals are to some degree incompatible. We may wish, for example, to achieve as 
much mutual independence as possible among the coordinate random variables Xk- 
In this way we would stay near the more familiar situation of i.i.d. sampling. It 
is clear, however, that for a given value of n not more than n/2 of the Xk can be 
mutually independent. Suppose, for example, that X\, X2, ■ ■ ■ , X m are mutually 
independent; then there is a positive probability (namely (e/2) m , if e > is small) 
that 

(1) X k e[l-e,l] (k = l,...,m). 
In this case we would have 

m n 

(2) m(l-e)<J2 X k = - x k<{n-m), 

k—l k— m+1 

which is not possible (for sufficiently small e) if 2m > n. On the other hand, if 
we compromise in other directions, the maximal mutual independence is easy to 
obtain. Consider, for example, an even sample size n = 2m and a RBS defined by 
choosing Xi,X%, . . . , X m independently (and each Xk uniform in [—1,1], of course), 
then setting 

(3) X m+k = -X k (k = 1, . . . , m). 

Here we have the coordinates partitioned into two subsets of m — n/2 mutually 
independent random variables (for example, the first m and the last m of the 
Xk)- This RBS, however, is degenerate in the sense that the distribution of X is 
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Figure 4. While the 4 dimensional M (5) cannot be easily visual- 
ized, each of its faces is similar to the elegant 3-dimensional object 
seen here. It may be regarded as a truncated tetrahedron; the 
result has 4 regular hexagons and 4 equilateral triangles as its faces. 

supported on a (n/2)-dimcnsional subset of the natural range M(n) for RBS of 
size n. Recall that 



so that M(n) has the much larger dimension n— 1. A sampling procedure based on 
such a RBS lacks robustness in a certain sense: it may overlook significant struc- 
ture in the distribution under investigation because that structure lies outside the 
support of the RBS. Following this line of thought, the dimension of the support 
of a RBS distribution has been viewed as a measure of robustness; in some inter- 
esting cases this dimension must be computed as a fractal dimension (see [8], for 
example; a fractal construction is also displayed in In W2.2\ we shall see how 
the degenerate RBS defined by © can be modified to achieve maximal robustness 
(n — 1) without much loss in mutual independence of the coordinates. 

A natural (n — 1) -dimensional model for M(n) is obtained by first constructing 
n unit vectors u\, v>2, ■ ■ • , u n in R™ -1 such that (v,i,Uj) = — l/(n — 1) whenever 
i ^ j. Then M(n) may be identified with {v € M™ _1 : (Vfc) - 1 < (v,u k ) < 1}. 
Corresponding to each such v we have X with X k = (v,Uk) (k = 1,2, ... ,n). 
See [S] for further details. These models of M(n) yield regular polytopes in K™ -1 . 
For example, M(3) is seen as a regular hexagon, as in Figures ^ an d [21 while M(4) 
is the regular octahedron outlined in Figure Figure 0] below presents one of the 
3-dimensional faces of M(5), which is itself a 4-dimensional object. 
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Along with independence and robustness, a third desirable feature of a RBS pro- 
cedure might be algorithmic efficiency, the efficiency with which sampling variables 
may be generated for use in experiments. The RBS defined by @, for example, 
is very efficient, since it involves very little beyond m calls to a random number 
generator to produce a sampling vector X. We shall see that the modifications 
proposed in W2. 21 retain most of this efficiency as they increase robustness. 



2.2. Redistribution. Here we illustrate, in a simple setting, a general procedure 
of redistribution that may be used to increase the robustness of a RBS. In this 
initial example, we confine ourselves to the redistribution of pairs of independent 
coordinates chosen from X. 

Lemma 4. Given independent X\,X2, each uniformly distributed on [—1,1], let 
S = X\ + X2 and define the new variables 

where T is uniform on [—1,1] and independent of the X^; then Yi,Y 2 are also 
independent and uniform on [—1,1]. 

Remark 5. Since Y± + Y2 = S = X\ + X2, we may think of the procedure in this 
lemma as redistributing the part of a RBS captured by X\ + X 2 . The geometry 
behind this construction is revealed by writing Y — (Yi,Y 2 ) = D + TQ where 
D = (S/2, S/2) is a point on the diagonal of [— 1, l] 2 and 

so that A = D + Q and B = D — Q are the endpoints of the segment in [—1, l] 2 
passing through D and perpendicular to the diagonal (see Figure^) . Thus Y is 
chosen in two steps: first D is chosen with a density proportional to the length 
of AB , then Y is placed at a point chosen uniformly along AB. This procedure 
strongly suggests that Y will be uniformly distributed over the square [— 1, l] 2 . We 
present a more formal proof below, computing the joint density of (Y\, Y 2 ). 



Proof. Let D = S/2. The joint density f(t,d) of (T, D) on [-1, l] 2 is given by 
1(1 — \d\), since T and D are independent and it is easy to compute the density of 



D on [-1, 1] as 1 



-\d\ 
(Yi,Y 2 



Also 



(1-\D\)T,D 



a 



\D\)T), 



a transformation with Jacobian matrix 



J(t, d) = 



(1 - Ml) 
-(l-|d|) 



1 — sign(d)i 
1 + sign(d)i 



(compute for the two cases d > and d < 0). It follows that det J(t, d) = 2(1 — |d|). 
The transformation is injective from [— 1, l] 2 to [— 1, l] 2 , so that we may compute 
(compare the discussion of "change of variable" in |18| . page 213) the joint density 
of (Yi,Y 2 ) at the image of (T,D) as f(t,d)/ det J(t, d) — 1/4, the uniform density 
on [—1, l] 2 . Thus Yi,Y 2 are uniform over [—1, 1] and are independent. □ □ 
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Figure 5. A graphical view of the redistribution algorithm for the 
case X x = 0.7, X 2 = -0.3. 

The following theorem shows how to redistribute selected pairs of coordinates 
from the degenerate RBS defined by J2J so that we obtain a RBS that is maximally 
robust. 

Theorem 6. Let X\, . . . , X m , Ti, . . . , T m be mutually independent random vari- 
ables, each uniform on [—1,1]. Assume to > 1 and let 

(6) S k = X k -X k+1 (k = l,...,m), 

with the understanding that X m +i = X\. Next (redistributing the coordinates com- 
bined in Sk as in the lemma) let 

(7) r 2fc _ 1 = f + (i-if)r fe , F 2fe = |-(i-ifi)T fe , 

for k = 1, . . . , to. Then Y defines a RBS of size n = 2m that is maximally robust, 
ie that has a set of dimension n — 1 as its support. Moreover, the distribution of Y 
is given by a density on M(n). 
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Proof. Since X k and —Xk+i are independent and uniform on [—1,1], Lemma 01 
(with appropriate change of notation) ensures that each coordinate 1 of Y is uniform 
on [-1,1]. Clearly Y k = £™ S k = 0, so that Y indeed defines a RBS. To 
see that it is maximally robust, note first that every y £ M(n) with sufficiently 
small \yk\ can occur as a value of Y. Indeed, if \yk\ < — for each k, we set (for 
k = 1,2, ... ,m) 

fc-i 

(8) Xh = -^2,r j , 

3 = 1 

where Tj — y 2 j-i + Vij- Note that x\ = x m +i — and that \x k \ < 2 ^ m n < 1- Wc 
have x k — x k+ \ = r k for each fc; for k — m this is a consequence of the fact that 
r k = Y^x Vk = 0- Solving for appropriate values of i*., we find that we require 
only that 

^ _ J/2fe-l — 2/2fc 

2 - is/2k— i +j/2*r 

It is easy to check that |i fc | < 1, simply because |j/2fc-i|, \y2k\ < h if V2k-i = U2k = 
±1, any value of t k will do. Thus, with X = x and T = t, we have 1" = y, as 
claimed. 

Next consider a neighborhood y(e) of such a y in M(n); we must show that the 
procedure of the theorem places sampling vectors in y (e) with positive probability. 
By the continuity of the procedure for obtaining Y from (X,T), there is a neigh- 
borhood u(6) of (x,z) in [—1, l] n that is mapped by the procedure into y(e). The 
probability of u(S) is already positive; indeed it's just the normalized n-volume of 
u(5), since the coordinates of (X,T) are chosen independently and uniformly in 
[-1,1]. 

To see that the distribution of Y is given by a density on M(n), we introduce 
the (linear) mapping L : [— 1, l] m — > 2M[m) defined by 

(9) L(x\, . . . ,x m ) = (xi - x 2 ,x 2 -x 3 ,...,x m - xi). 

The analysis above shows that y € M(n) occurs as a value of Y exactly when 
L(x) = r for some x £ [—1, l] m . Given e > 0, consider the map 

/ : (1 - e)M(n) 2M(m) x [-1, l] m 

defined by f(y) — (fi(y), fziy)), where fi(y) = r and /a(y) = The denominators 
in the expressions for the t k are bounded away from zero since 2 — | j/2fc— l + U2k\ ^ 
2 — 2(1 — e) = 2e, so that / is Lipschitz on (1 — e)M(n) for each fixed e. Given a Borel 
subset, B, of (1 — e)M(n), the probability assigned to B by the distribution of Y 
is the (normalized) n-volume (or Lebesgue measure) of {(x, i) : (L(x),t) € f(B)}. 
Since L is linear and [—1, 1]™ is bounded, this is at most a constant times the (n—1)— 
volume of f(B). As / is Lipschitz, this is in turn bounded by a constant times the 
(n — 1)— volume of B. Certainly then, the distribution is absolutely continuous with 
respect to normalized Lebesgue measure on (1 — e)M{n). Considering a sequence 
of e-values tending to 0, we see that the distribution is absolutely continuous on 
M(n) itself and so given by a density with respect to the normalized (n— 1)— volume 
onM(n). □ □ 



We remark that it may be of interest to assess the degree to which these coordinates are 
mutually independent. 



10 



PETER BUBENIK AND JOHN HOLBROOK 



2.3. Using all of M(n). The procedure of the last section retains algorithmic 
efficiency and it is robust in the sense of dimension, but in most cases the sampling 
values do not fill all of M(n). Here we show how to modify the construction to 
obtain RBS procedures that have all of M(n) as support. First let us clarify the 
reasons for the failure of the construction in H2.2I to cover M(n); we reuse the 
notations of the proof above. As y ranges over M(n) the corresponding r fills all of 
2M(m) (recall that n — 2m). The procedure will yield every y £ M(n) as a possible 
value exactly when 2M(m) = L([—l, l] m ). Since L([— 1, l] m ) is the convex hull of 
the images of the 2 m extreme points of [— 1, l] m , we can simply check whether these 
images include the extreme points of 2M(m), which are easy to identify (at most 
one coordinate can differ from ±2). 

For m = 2, for example, the extreme points of 2M(2) are 

(10) (2, -2) = L(l, -1) and (-2, 2) = L(-l, 1), 

so that the procedure of H2.2l for m = 2 gives a RBS distribution on M(4) having 
a density that is positive everywhere on the octahedron. One can check that this 
density is unbounded, in contrast to the piecewise linear density found by Gerow 
(see section 3, below). For m = 3 we again get a RBS distribution with all of M(6) 
as support, because the extreme points of 2M(3) are: 

(11) ±(2,-2,0) = L(±(l,-l,l)),±(0,2,-2) = L(±(l,l,-l)), etc. 

For m = 4, however, one can compute in a similar fashion that L{[— 1, 1] ) is 
a sort of rhombic dodecahedron lying strictly inside the octahedron 2M(4). See 
Figure below. Thus, except in a few simple cases, the procedure of M2.2I while it 
is maximally robust in terms of dimension, does not fill M(n). As we'll now see, 
the remedy is simply to symmetrize this procedure. 

Lemma 7. Given any w G M(n), there is a permutation a of {1, 2, . . . , n} such 
that for z = a(w) := (w a m, . . . , w a r n \) all partial sums 

k 

(12) J> 3 (k=l,...,n) 

.7=1 

lie in [-1,1]. 

Proof. We can choose the values of a(i) inductively. Suppose that (distinct) cr(l), . . . , 
have been chosen so that the partial sums (|12fl lie in [—1.1] for all k < i. Since 

i 

(13) w °{i) + W J = °' 

j=l j$<r({l,...,i}) 

we can choose <j(i + 1) such that has sign opposite from that of the first 

sum in l|13|) : it follows that the partial sum (|12fl lies in [—1, 1] for k = i + 1 as 
well. □ □ 

Theorem 8. Given n = 2m, let the RBS Y be defined as in Theorem\Q and let a 
be a randomly chosen permutation of {1, . . . , n}. Then W — o~(Y) defines a RBS 
with all of M(n) as support. 

Proof. For any w € M(n) using the a from Lemma w — ct~ 1 (z) where all the 
partial sums of itT^ lie in [—1,1]. The proof of Theorem showed that any such 
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FIGURE 6. The fact that the rhombic dodecahedron does not fill 
up all of the octahedron 2M(4) reveals the need for symmetrization 
in order to generate a RBS supported on all of M(8). 



z could occur as a RBS Y defined in H2.2I Thus w can occur as t(Y) for the 
permutation t = o~~ . □ □ 

2.4. Odd sample sizes. For odd sample sizes, n = 2m + 1 we can likewise con- 
struct an algorithmically efficient degenerate RBS with two disjoint maximal sub- 
sets of mutually independent variables. This can be done by choosing Xi, . . . , X m 
(with each X k uniform in [—1,1]) and B G {—1,1} independently, and then setting 

X m+k = -X k (k = l,...,m- 1) 

X2m = — — (X m + B) 
X2m+1 = ~2^ m ~ ^ 

It is easy to check that this gives a RBS. As in the even case, we can make this 
distribution robust by redistributing pairs of the variables. 

Theorem 9. Let X\, . . . , X m , Ti, . . . , T m be mutually independent random vari- 
ables, each uniform on [—1, 1] and B be a discrete random variable, uniform on 
{ — 1, 1}. Assume m > 1 and let 

S k =Xk- X k+ i, {k = 1, ... to 2) 
S m -i — X m ^i — 
S m = — — (X m — B) — X\ 
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Next redistribute the variables as in Lemma^ 

Yok-i = y + f 1 ~ ^r) Tfe > (k = l,...,m) 
Y 2k = ^-(l-^T k (* = l,...,m) 

Yzm+l = -^m 

TTien y is an o/ size n = 2m + 1 that is maximally robust. That is, it has a 
set of dimension n — 1 as its support. 

Proof. First note that — \{X m + B) and — \{X m — B) are uniformly distributed 
on [—1,1]. As a result, for each k, Sk is the sum of two independent uniform 
variables, and by Lemma 0] each coordinate of Y is uniform on [—1,1]. Also, 
ELi Y k = El'Li S k + Y 2m +i = -\{X m +B)- i(X m — B) + X m = 0, so Y does 
indeed define an RBS. 

To see that this RBS is maximally robust, we will show that every y € M(n) with 
sufficiently small coordinates can occur as Y. If \yk\ < for each k, construct x 
and t as follows. Fix b € {—1, 1} and let r^ = i/2fe-i + V2k for k = 1, . . . , m, and let 

m 
i=l 

m— 1 

xk = x fam + &) + ^ n, (fc = 1, . . . , m - 1). 

i—k 

Then for k < m, Xk — l(X)i=i( — l) ai ri+b), for some choice of a^. So since |ffe| < 
|arfc| < 1 for all k. Also for k < m — 2, 

Sfe ^fc ^fc+1 — ^m- 1 — Xm — i ^(*^m ~l~ ^) — ^m— 1) 

and s m = — — (x m — 6) — xi = — x m 

Thus we have a .t G [— l,l] m such that the corresponding Sk = r^ for each k. 
We can then solve for tk € [—1,1] such that Y 2 k-i — Vih-x an d Y~ 2 fc — J/2fe- Also 

2m+l = a^m = - 2-^=1 1~i = - l^k=l Vk = 2/2m+l- 

Finally consider the neighborhood y(e) of y in M(n); we must show that the 
procedure of the theorem places sampling vectors in y(e) with positive probability. 
By the continuity of the procedure there is a neighborhood u(8) of (x, i) in [—1,1]" 
that is mapped by the procedure into y(e). The probability of u(5) is already 
positive; indeed it's just the normalized n-volume, since the coordinates are chosen 
independently and uniformly in [—1, 1]. □ □ 

Note that the procedure above will yield a y <E M(n) if and only if Xf. = 
Ei=ik-i Hi + \ {Vn + b) lies in [—1,1] for k = 1, . . . ,n— 3. By symmetrizing the pro- 
cedure and proving the following lemma which is slightly stronger than necessary, 
we will get a RBS that fills all of M{n). 



rn— 1 

^ T% — Trn ■ 
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Lemma 10. Given any w G M(n) where n is odd, there exists a permutation a of 
{1, 2, . . . , n} and a b G { — 1, 1} such that for z = o~(w) := (tu^m, . . . , W a <\\) all of 
the sums 

n— 3 

(14) ^z, + -(z„ + 6) (fe = l,...,n-3) 

Zie m [—1,1]. 

Proof. Choose cr(n) such that z n = wv(n) > 0. Let b — —1. Let a = — \{z n + 6) G 
[0, 4]. The condition now becomes 

n— 3 

:= 2_, %i € [—1 + a,l + a] (k = 1, . . . ,n — 3) 

Taking S'„_2 = we will prove the above condition by induction. Assume o~(n — 
3), . . . , cr(fc) are defined such that Sk € [—1 + a, 1 + a]. 
If 5fe G [a, 1 + a] then J2*i=k w <y(i) + w<r(n) > 0. Since 

n-3 

2J W a (i) + «V( n ) + J! Wl = 

i=fc i^cr({fc,...,n— 3,n}) 

we can choose cx(&; — 1) such that uv(fc_i) < 0. Then Sk-i G [—1 + a, 1 + a]. 

IfSfc-i G [— l + a,a] then if possible choose a(k— 1) such that u><j(fc_i) > 0. Then 
Sk-i G [— 1+a, 1+a]. Otherwise choose any a(k—l) and Wi £ a({k—l, n— 3, n}), 
< 0. So S"=fc-i ^o-(i) + ^o-(n) > and thus S fe _i > -w a{n) = -1 + 2a > 
-1 + a. Therefore 5fe_i G [-1 + a, 1 + a]. □ □ 

Theorem 11. Given n = 2m + l, let the RBSY be defined as in the above theorem 
and let a be a randomly chosen permutation of {1, . . . , n}. Then w — o~(Y) defines 
a RBS with all of AI(n) as support. 

Proof. The proof follows from Theorem [!|] and Lemma I1UI in much the same way 
as Theorem [S] followed from Theorem and Lemmad That is, for any w G M(n) 
using the a and b from Lemma ITU1 w = cr _1 (z) where all the sums in (|14f) lie in 
[— 1, 1]. The proof of Theorem [5] showed that any such z could occur as a RBS Y 
as defined in the statement of that theorem. Thus w can occur as t(Y) for the 
permutation t = o~~ . □ □ 

3. GEROW-ROBSON DENSITIES 

We are interested in probability densities on M(n) with respect to the measure 
defined as follows. Let 1 denote the vector (1,1,..., 1) G R", and let 1 
denote the hyperplane consisting of all vectors perpendicular to 1. For each Borel 
subset S of P-, let v n ^i(S) denote the (n — l)-dimensional Lebesgue measure of 
S, regarding 1 as an isometric copy of R n_1 . Let V n -i = v n -i(M(n)). 

By a Gerow-Robson (G-R) density on M(n) we shall mean a probability density 
h with respect to v n -\ such that 



(15) 



h{X) = f{\\X\\ 00 ) (XeM(n)) 
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for some / : [0, 1] — * [0, oo), where 

(16) HXIU = max \X k \. 

l<k<n 

We remark that on M(3) and M(4) the subspaces with constant oo-norm are rep- 
resented by hexagons and octahedra respectively. This definition is suggested by 
the densities discovered by Robson and Gerow for the cases n = 3,4. They noted 
that with f(s) proportional to s, ie with /s(s) = C^s and fi(s) = C^s for cer- 
tain constants Cfc, the corresponding G-R densities define RBS distributions on 
M(3),M(4), respectively (see Figure^]). These facts will also follow from the gen- 
eral analysis of G-R densities given below. It was natural to try to extend this 
construction to larger values of n. We'll prove below a rather surprising fact: not 
only do densities with /„(s) = C n s fail to generate RBS distributions when n > 4, 
but (at least for n < 250) no other choices of f n yield RBS distributions. Con- 
cisely, it seems that the only G-R densities that yield RBS distributions are those 
discovered by Robson and Gerow. 

As a historical note, Gerow was already aware via numerical simulations reported 
in ^D] that f(s) = C$s did not yield a RBS distribution. In this was verified 
theoretically and it was shown that, in fact, no choice of /„ gives a RBS distribution 
for n = 5. 

Here we will extend this result to larger values of n. For n > 6 we will derive a 
sufficient condition for the nonexistence of a G-R density on M(n). This condition 
can be verified numerically for n < 250. We will also show that the densities given 
by Gerow and Robson are the only such densities on M(3) and M(4) and we will 
reprove the nonexistence of a G-R density on M(5). 

We conjecture that the condition we will derive for the nonexistence of a G- 
R density on M(n) holds for all n > 6. While we are unable to prove this, our 
computational results settle all cases likely to be of practical interest. For very large 
samples sizes, constructing random balanced samples is perhaps less important, 
since for large values of n a uniform sample from [—1, 1]™ is, in view of the law of 
large numbers, likely to be very nearly balanced. 

For any probability measure P on M(n) we have the corresponding distribution 
function defined on [0, 1] by 

(17) G(r) = P{rM(n)) = P{{X e M(n) : \\X\\ X < r}). 
In those cases where G has a density on [0, 1] we denote it by g: 



(18) G{r) = P{rM(n)) = g(s)ds. 

Jo 

If P is defined by a G-R density h corresponding to /, we must have 

(19) G{r)= ^ f( S )(v n ^( s M(n))Yd S , 

Jo 

and since v n -i(sM(n)) = s n - 1 v n -i(M(n)) = s n ~ 1 V n -i, 

(20) G{r)= f f( S )(n-l)V n - lS n - 2 ds. 



o 



Thus, g(s) = (n — l)V n -iS n ~ 2 f(s) for a G-R density. This relation, along with 
the fact that G(l) = 1, allows us to properly normalize /. For example, in the 
cases treated by Robson and Gerow we have f n (s) = C n s\ normalizing g we see 
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that g n (s) — ns n . Thus the normalizing constants C n are given by ( - K _ 1 "y — -• A 
little computation reveals that V2 — 3\/3 and V3 = 32/3, so that, in our models, the 
densities found by Robson and Gerow correspond to /^(s) = s/2v3 and fi(s) = s/8. 

Conversely, given any probability density g on [0, 1], we shall see how to construct 
a probability measure on M(n) with the corresponding G-R density: for X € M(n), 



(2D h(x) = f(\\x\\ 00 ) = f^H _ 2 - 

It is natural to ask for which g and for which n we obtain RBS distributions. The 
answer, as claimed above, is given by the following result. 

Theorem 12. For n < 250, the G-R density on M(n) corresponding to a density 
g on [0, 1] defines a RBS distribution in exactly two cases: n — 3 with 33(5) = 3s 2 , 
and n = 4 with 34(3) = 4s 3 . 

Before turning to the proof of this theorem, we introduce a specific model for 
the distribution of X 6 M(n) according to the G-R density implied by a density g 
on [0, 1]. Consider auxiliary random variables Y\ and Z2, ■ ■ ■ ,Z n defined as follows: 
< Yi < 1 has the given density g, and (independently) (Z 2 , ■ ■ ■ , Z n ) is uniformly 
distributed with respect to the (n — 2)-volume on 

n 

(22) {ZeKir'^z^-i}. 

fc=2 

Let Yfc = YiZ k for k = 2, . . . ,n, so that Yi > \Y k \ and ELi r fe = °- Then 
X = Y yields sample points in that part of M(n) where ||-?||oo = -X'l- Finally, we 
symmetrize over M{n): 

(23) X = ±(Y <7(1) ,...,Y a(n) ), 

where the sign ± and the permutation a are chosen at random. 

Proposition 13. Relative to (n — l)-volume on M(n), this X has a density of the 
G-R form JUJ). 

Proof. It suffices to consider a point X 6 M(n) where 

II^Hoo = Xx > \X k \ (k = 2,...,n), 

and the behavior as At — > of the probability P(X + AtM(n)) that a sample point 
falls in the neighborhood X + AtM(n). Denote this probability, briefly, by P(At). 
Then 

P{At) = P{± = +}P{o-(l) = l}P{Y k e [X k -At,X k + At] {k = 1, . . . , n)}. 

For small values of At, P{Yi € [X\ - At, X\ + At]} is close to 2Atg(X 1 ), and given 
Y\ the probability 

P{Y k e[X k -At,X k + At] (k = 2,...,n)} = 
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is nearly proportional to (At/ Xi) n ~ 2 , since Y\ ss X\ and the hyperplane {^2 = 
— 1} makes the same angle with each coordinate axis. Thus P(At) is nearly pro- 
portional to 

\ ■ - ■ 2Atg(X 1 ) ■ (At/X^- 2 
I n 

as At — > 0. It follows that the density of P relative to (n — l)-volume, ie 

urn — — t— , 

At|o (Aty- x v n - x 

is proportional to 

g[Xx) _ ffO^U) 

xr 2 \\x\\^ 2 ' 

We have, therefore, a G-R density with /(s) = Kg(s)/ s™~ 2 , for some constant 
K, and we must have ||2U, ie K = l/(n — l)V^_i, because of the general relation 
between / and g for G-R densities. □ □ 

To see whether a given density g on [0, 1] generates a RBS distribution on M(n) 
we must examine the values of P{Xk G [a, b]} in the model defined above. It is suf- 
ficient to compute P{Xi G [0,i]} since the coordinates X^ have been symmetrized, 
ie they are interchangeable and — X/, has the same distribution as X^. Thus, the 
sampling vector X will have a RBS distribution on M(n) exactly when 

(24) P{Xl e[o,t]}= t - (te[0,l])« 

Recalling the equidistribution of Y%, . . . , Y n , we compute: 

p{x 1 e [0, t]} = P{± = +}P{a(i) = i}P{Y 1 e [0, t]}+ 

P{± = +}P{a(l) ± 1}P{Y 2 G [0, t]} + P{± = -}P{a(l) + 1}P{Y 2 G [-*, 0]} 
= ^ / 9(8) ds + ~(1 - -)P{Y X Z 2 G l-t, t}} 

= f g{s)ds + \(l--) f g(s)P{Z 2 e [—,-]} ds. 

2n J 2 n J s s 

Let P n {f) = P{\Z 2 \ < r}. Since P n (t/s) = 1 when s < t, our condition for a RBS 
distribution becomes: for all t G [0, 1], 

(25) \ = \ ^ g(s) ds + 1(1 - 1) J 1 g(s)P n ( t -) ds. 
Differentiating with respect to i, we obtain the condition: 

(26) 1 = -g(t) + (1--) f g{s)P' n {-) - (t G [0, 1]). 

n n J t s s 

A little geometry reveals that Ps{r) = r and Pi{r) = r, so that we may verify 
that H26fl is satisfied with n = 3, gs(s) = 3s 2 and with n — 4, g 4 (s) = 4s 3 . This 
is one way to verify the discoveries of Robson and Gerow that f n (s) = C n s yields 
RBS distributions for n = 3, 4. By a more involved geometric argument one may 
obtain Ps(r) = (24s — s 3 )/23 and see that 35 (s) = 5s 4 does not satisfy (with 
n = 5), ie that the most "natural" generalization of the constructions of Robson 
and Gerow do not yield a RBS distribution for sample size 5. To use lf27i)) more 
systematically, we must first find some general expressions for P' n {r) 
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Note that the (n — 2)-volume on M x [—1,1]" 2 n{5^2 %k — —1} can be sampled 
uniformly by choosing Z%, . . . Z n independently and uniformly on [—1, 1] and then 
setting Z 2 = -1 - £3 Z k . Thus the (n - 2)-volume on [-1, l]"- 1 n Z k = -1} 
can be sampled uniformly by repeating the above procedure until Zi G [—1,1]. 
That is, until £3 Z k S [0, 2]. Thus, 

(27) P.(.) := n\Z,\ < ,) - ^^g.^;^.;"" (8£ 

where the are now independent and uniform over [—1,1]. Let (f> be the uniform 
density over [—1, 1], ie <j> = ^/[-i^j. Then the density <p n of ^3 Z k is the (n — 2)- 
fold convolution of </> with itself. Note that 4> n is an even function. It follows that, 
for s £ [0,1], 

f, 1+S d> n (u) du 

and that 

6„(1 + s) + 0„(1 - s) _ </>„(! + s) + 4> n (l - s) 



KM = 



^4> n {u)du 20 n+1 (l) 



These functions, for s £ [0, 1], are polynomials in s. One way to see this, and to 
obtain explicit expressions for P^(s), is to compute in terms of Laplace transforms. 
By the Laplace transform £ip( s ) °f a function ip(t) (with left-bounded support) we 
mean 

/oo 
e~ st ^(t)dt (s>0), 
-00 

We shall use several well-known properties of C; for example, C converts convolu- 
tion products ipi * ip2 into ordinary pointwise products: 

£{-0i*V>2}(s) = Cipi(s)Cip 2 (s). 

Since C<p(s) — (e s — e~ s )/2s, it follows that 

(e s - e~ s ) n - 2 



C(j) n {s) 



>n— 2 c n— 2 



Now e rs /s n - 2 = C{(t + rYl- 3 /(n-3)l}(s), where t+ denotes Jf(f)t, Jf(t) being the 
Heaviside function. Since C is injective, 



(28) ^)= (n _3 )!2 n- 2 E( )(-l) fe (* + n-2-2 fc) 



In evaluating P n (s) (for s S [0,1]), we need only apply i(2*5|) for t e [0,2] and we 
obtain 



n-2 f 



k=0 



(29) K(s)=CnJ2[ , K-l) fc {("-l-2fc + .s)r 3 + (n-l-2fc- S )r 3 }, 



where C n = l/((n - 3)!2«- 1 n+1 (l)). 

Let us consider the case of even n, say n = 2to. Then in (|29|) . 2m— 1 — 2fc±s < 
if and only if k > m (always with s£ [0, 1]). So 

(30) P^s) ( 2{m , 1) )(-l) fe {(2m-l-2fc+ S r- 3 +(2m-l-2fc- S )"- 3 }, 

fc=0 ^ ' 
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The odd case is the same but with one additional term. In both cases, the odd 
powers of s cancel, and the polynomial has the form 

(si) ^(s) = a£c 3> ^, 

3=0 

for certain coefficients Cj.„, where a n — \_(n — 3)/2j. These coefficients may be 
evaluated explicitly in any specific case, by reference to (|29[) : later we'll take a 
somewhat different point of view to derive some general properties of the Cj jTl . 

Proposition 14. The G-R density corresponding to a probability density g n on 
[0,1] defines a RBS distribution on M(n) exactly when 

n 

(32) Cqnis) = s(l + (n-l)LQ n{s) y 

where q n (t) = H(t)g n (e-*) and Q n (t) = ff(t)i* (e"*). 

Proof. The condition 1261) . with g = g n , may be rewritten in the form 

f 1 e~* ds 

n = q n (t) + (n-l) g n ( s ) P ^ — ) — ( t > 0). 

J e -t s s 

With the change of variable s = e~ u , this becomes 

n = g n (t) + (n - 1) / q n (u)P^e-^)du (t>0), 



le 

/oo 
q n (u)Q n (t ~ u) du, 
-oo 

ie 

nH = q n + (n - l)q n -k Q n . 
Applying the Laplace transform we obtain 
n 

- = Cq n (s) + (n - l)Cq n (s)CQ n (s). 
s 

□ □ 

Now in terms of l|3I|) we have 

Ctn. 

Q„(t) =H(t)C n J2c j , n e- 2jt > 

3=0 

so that 



s + 2 j 

From Proposition^Jit follows that, in order for g n to generate a RBS distribution, 
we must have 

(33) Cq n (s) = n/(s(l + (n - 1)C„ £ -^)). 

Using this condition one can check that the densities discovered by Gerow and 
Robson are the only G-R densities for n = 3,4. Indeed evaluating i|33|l . Cq 3 (s) = 
jjp2 and £qi(s) = Thus, since C is injective, q 3 (s) = 3e _2t and q 3 (s) — 4e _3t . 
So g 3 (t) = 3t 2 and g^t) = At 3 . 
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Continuing, using (|33[) one can check that £05 (s) = 23s*+ 5 i30s+i92 an< ^ thus 
q B (t) = T | T e-i*(191cos(^it) - 19v / 197sin(^±IXi)). However g 5 (1.5) < 0. This 
contradicts the fact that q(t) = g(e~ t ) and g is a probability density on [0,1]. 
Therefore there is no G-R density on M(5). 

We can generalize this non-existence result to higher n without explicitly calcu- 
lating Cq n and q n . 



Let 



B n (s)=[l[(s + 2j)) (l + (n-l)C n £-^, 



\j=0 J \ 3=0 

Then B n (s) is a polynomial of degree a n + 1 and 

tin (a) = . 

B n {s) 

Assume that B n (s) has a n + 1 real distinct roots ao > ai > . . . a Qji . Then for 
some k ^ and some {6-, ^ 0}, 

^ng 1 ( g+ 2j ) _- bj 

Since £ is injective, 

q n (t) = J2b j e a * t . 

3=0 

Since ao > a\ > . . . > a a „> f° r large values of t the sign of q n (t) equals the sign 
of bo- Since g is a probability density, q(t) > for all t > 0. So 6o > 0. We have 
shown the following. 

Theorem 15. Let n > 6. If B n (s) has a n + 1 distinct real roots and bo < i/ien 
f/iere does woi exist a G~R density on M[n). 

With a few hours of computation on a desktop computer using a computational 
program such as Maple one can verify that B n (s) has a n + 1 distinct real roots 
for 6 < n < 250. We remark that we can show if B n (s) has a n + 1 distinct real 
roots then — 3 < ao < — 2 and bo < 0. However, to prove Theorem 1121 we do not 
need to use this fact. The second condition can be verified computationally for 
6 < n < 250. For example, for a given n one can check that —3 < a < —2. Since 
for all s S (—3, —2), the sign of B n (s) is the opposite of the sign of Cq n (s), it follows 
that b„ < if and only if B n (-3) < and B n (-2) > 0. This proves Theorem [HI 
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